You've learnt about all the core aspects of data exploration, data cleaning, and data visualisation. Download, unzip and open the notebook I've included for this assignment.
You'll find an incredibly rich dataset from nextspaceflight.com that includes all the space missions since the beginning of Space Race between the USA and the Soviet Union in 1957! It has data on the mission status (success/failure), the cost of the mission, the number of launches per country, and much much more. There's so much we can learn from this dataset about the dominant organisations and the trends over time. For example:
Who launched the most missions in any given year?
How has the cost of a space mission varied over time?
Which months are the most popular for launches?
Have space missions gotten safer or has the chance of failure remained unchanged?
I'm sure that you'll discover many more questions that you can formulate and answer with this dataset! Use it to practice what you learnt about creating various types of charts and visualisations, from choropleths to sunburst charts to segmented bar charts and see if you can turn data into insight. Good luck!
Reflection Time:
This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.
Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?
Reflection Time:
This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.
Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?
sklsbjso
Give Feedback
What went well? What could be improved?
Reflection Time:
This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.
Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?
import pandas as pd
import matplotlib.pyplot as plt
# Load the space mission data from a CSV file
data = pd.read_csv('space_missions.csv')
# Convert the date column to datetime format
data['Date'] = pd.to_datetime(data['Date'])
# Extract year from the date column
data['Year'] = data['Date'].dt.year
# Group the data by year and count the number of missions
missions_per_year = data.groupby('Year').size()
# Plot the number of missions per year
plt.figure(figsize=(10, 6))
missions_per_year.plot(kind='line', marker='o')
plt.title('Number of Space Missions per Year')
plt.xlabel('Year')
plt.ylabel('Number of Missions')
plt.grid(True)
plt.show()
Give Feedback
What went well? What could be improved?
Reflection Time:
This is a place to journal your experience of completing this project. This will help you figure out how to improve as a developer.
Write down how you approached the project. What was hard, what was easy. How might you improve for the next project? What was your biggest learning from today? What would you do differently if you were to tackle this project again?
# with this exercise , I created a template for charting, its a bit long cut process but it teaches me and let me memorize the methods by repition: name_frequency = clean_data['Organisation'].value_counts().reset_index() name_frequency.columns = ['name', 'frequency'] name_frequency.shape name_frequency.head plt.figure(figsize=(14,8)) plt.xticks(fontsize=14, rotation=90) plt.yticks(fontsize=14) plt.xlabel('Organisation', fontsize=14) plt.ylabel('Number of Launch', fontsize=14) plt.title('Number of Launches per Company', fontsize=24) plt.bar(name_frequency.name[:24], name_frequency.frequency[:24]) # I also learned how to use plotly and matplotlib, and differenciate their pros and cons when using it
Give Feedback
What went well? What could be improved?